A new statistical excitation mapping for enhancement of throat microphone recordings
نویسندگان
چکیده
In this paper we investigate a new statistical excitation mapping technique to enhance throat-microphone speech using joint analysis of throatand acoustic-microphone recordings. In a recent study we employed source-filter decomposition to enhance spectral envelope of the throat-microphone recordings. In the source-filter decomposition framework we observed that the spectral envelope difference of the excitation signals of throatand acoustic-microphone recordings is an important source of the degradation in the throat-microphone voice quality. In this study we model spectral envelope difference of the excitation signals as a spectral tilt vector, and we propose a new phone-dependent GMM-based spectral tilt mapping scheme to enhance throat excitation signal. Experiments are performed to evaluate the proposed excitation mapping scheme in comparison with the state-of-the-art throat-microphone speech enhancement techniques using both objective and subjective evaluations. Objective evaluations are performed with the wideband perceptual evaluation of speech quality (ITU-PESQ) metric. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed statistical excitation mapping consistently delivers higher improvements than the statistical mapping of the spectral envelope to enhance the throat-microphone recordings.
منابع مشابه
Speaker-dependent mapping of source and system features for enhancement of throat microphone speech
A throat microphone (TM) produces speech which is perceptually poorer than that produced by a close speaking microphone (CSM) speech. Many attempts at improving the quality of TM speech have been made by mapping the features corresponding to the vocal tract system. These techniques are limited by the methods used to generate the excitation signal. In this paper a method to map the source (excit...
متن کاملAn analytic modeling approach to enhancing throat microphone speech commands for keyword spotting
This research was carried out on enhancing throat microphone speech for noise-robust speech keyword spotting. The enhancement was performed by mapping the log-energy in the Mel-frequency bands of throat microphone speech to those of the corresponding close-talk microphone speech. An analytic equation detection system, Eureqa, which can infer nonlinear relations directly from observed data, was ...
متن کاملFabrication and investigation of a transparent and flexible loudspeaker and microphone based on carbon nanotube
Transparent acoustic sensors and actuators are a new generation of acoustic transducers that can create an evolution in the microphone and loudspeakers industries. These transducers with properties like transparency, flexibility, flatness, very low weight and thickness have a great potential for various applications like public speakers, active noise cancelation systems, displays, cell phones a...
متن کاملMapping Speech Spectra from Throat Microphone to Close-Speaking Microphone: A Neural Network Approach
Speech recorded from a throat microphone is robust to the surrounding noise, but sounds unnatural unlike the speech recorded from a close-speaking microphone. This paper addresses the issue of improving the perceptual quality of the throat microphone speech by mapping the speech spectra from the throat microphone to the close-speaking microphone. A neural network model is used to capture the sp...
متن کاملشکلدهی وفقی و هوشمند پرتو در آرایههای میکروفونی Ad-hoc با استفاده از خوشهبندی و رتبهبندی میکروفونها
Considering the existence of a many speech degradation factors, speech enhancement has become an important topic in the field of speech processing. Beamforming is one of the well-known methods for improving the speech quality that is conventionally applied using regular (classical) microphone arrays. Due to the restrictions in the regular arrangement of microphones, in recent years there has be...
متن کامل